30 research outputs found

    Bio-inspired speed detection and discrimination

    Get PDF
    In the field of computer vision, a crucial task is the detection of motion (also called optical flow extraction). This operation allows analysis such as 3D reconstruction, feature tracking, time-to-collision and novelty detection among others. Most of the optical flow extraction techniques work within a finite range of speeds. Usually, the range of detection is extended towards higher speeds by combining some multiscale information in a serial architecture. This serial multi-scale approach suffers from the problem of error propagation related to the number of scales used in the algorithm. On the other hand, biological experiments show that human motion perception seems to follow a parallel multiscale scheme. In this work we present a bio-inspired parallel architecture to perform detection of motion, providing a wide range of operation and avoiding error propagation associated with the serial architecture. To test our algorithm, we perform relative error comparisons between both classical and proposed techniques, showing that the parallel architecture is able to achieve motion detection with results similar to the serial approach

    Combination of Standard and Complementary Models for Audio-Visual Speech Recognition

    Get PDF
    In this work, new multi-classifier schemes for isolated word speech recognition based on the combination of standard Hidden Markov Models (HMMs) and Complementary Gaussian Mixture Models (CGMMs) are proposed. Typically, in speech recognition systems, each word or phoneme in the vocabulary is represented by a model trained with samples of each particular class. The recognition is then performed by computing which model best represents the input word/phoneme to be classified. In this paper, a novel classification strategy based on complementary class models is presented. A complementary model to a particular class j refers to a model that is trained with instances of all the considered classes, excepting the ones associated to that class j. The classification schemes proposed in this paper are evaluated over two audio-visual speech databases, considering acoustic noisy conditions. Experimental results show that improvements in the recognition rates through a wide range of signal to noise ratios (SNRs) are achieved with the proposed classification methodologies.Sociedad Argentina de Informática e Investigación Operativa (SADIO

    Prototype Robot for Computer Vision and Control Systems Applications

    Get PDF
    This paper describes a robot designed and developed by a student in the context of an Electronic Engineering degree course. This robot is composed by three wheels, two of them can be controlled inde- pendently and the third one is used for stability. The robot also includes a webcam provided with pan and tilt control. This work was focused on the implementation of a prototype useful for academic research in the areas of Computer Vision and Control Systems Dynamics. In this document, the main characteristics of this robot are described.Sociedad Argentina de Informática e Investigación Operativa (SADIO

    Combination of Standard and Complementary Models for Audio-Visual Speech Recognition

    Get PDF
    In this work, new multi-classifier schemes for isolated word speech recognition based on the combination of standard Hidden Markov Models (HMMs) and Complementary Gaussian Mixture Models (CGMMs) are proposed. Typically, in speech recognition systems, each word or phoneme in the vocabulary is represented by a model trained with samples of each particular class. The recognition is then performed by computing which model best represents the input word/phoneme to be classified. In this paper, a novel classification strategy based on complementary class models is presented. A complementary model to a particular class j refers to a model that is trained with instances of all the considered classes, excepting the ones associated to that class j. The classification schemes proposed in this paper are evaluated over two audio-visual speech databases, considering acoustic noisy conditions. Experimental results show that improvements in the recognition rates through a wide range of signal to noise ratios (SNRs) are achieved with the proposed classification methodologies.Sociedad Argentina de Informática e Investigación Operativa (SADIO

    Identification and characterization of crops through the analysis of spectral data with machine learning algorithms

    Get PDF
    This paper assesses the capability of an spectrometer used in field experiments of soybean, maize and wheat. The objective of this work is to select different wavelengths intervals of the spectral reflectance curve, within the range 632-1125 nm, as features for classification using machine learning methods. Two different classifications are presented, species selection and growth stage identification. For species classification accuracy of 92% is reached, while 99% is obtained for stage classification. In addition we propose a new index that outperforms analyzed established vegetation indices, which shows the potential advantage of using this type of devices.Sociedad Argentina de Informática e Investigación Operativ

    Identification and characterization of crops through the analysis of spectral data with machine learning algorithms

    Get PDF
    This paper assesses the capability of an spectrometer used in field experiments of soybean, maize and wheat. The objective of this work is to select different wavelengths intervals of the spectral reflectance curve, within the range 632-1125 nm, as features for classification using machine learning methods. Two different classifications are presented, species selection and growth stage identification. For species classification accuracy of 92% is reached, while 99% is obtained for stage classification. In addition we propose a new index that outperforms analyzed established vegetation indices, which shows the potential advantage of using this type of devices.Sociedad Argentina de Informática e Investigación Operativ

    Identification and characterization of crops through the analysis of spectral data with machine learning algorithms

    Get PDF
    This paper assesses the capability of an spectrometer used in field experiments of soybean, maize and wheat. The objective of this work is to select different wavelengths intervals of the spectral reflectance curve, within the range 632-1125 nm, as features for classification using machine learning methods. Two different classifications are presented, species selection and growth stage identification. For species classification accuracy of 92% is reached, while 99% is obtained for stage classification. In addition we propose a new index that outperforms analyzed established vegetation indices, which shows the potential advantage of using this type of devices.Sociedad Argentina de Informática e Investigación Operativ

    Combination of Standard and Complementary Models for Audio-Visual Speech Recognition

    Get PDF
    In this work, new multi-classifier schemes for isolated word speech recognition based on the combination of standard Hidden Markov Models (HMMs) and Complementary Gaussian Mixture Models (CGMMs) are proposed. Typically, in speech recognition systems, each word or phoneme in the vocabulary is represented by a model trained with samples of each particular class. The recognition is then performed by computing which model best represents the input word/phoneme to be classified. In this paper, a novel classification strategy based on complementary class models is presented. A complementary model to a particular class j refers to a model that is trained with instances of all the considered classes, excepting the ones associated to that class j. The classification schemes proposed in this paper are evaluated over two audio-visual speech databases, considering acoustic noisy conditions. Experimental results show that improvements in the recognition rates through a wide range of signal to noise ratios (SNRs) are achieved with the proposed classification methodologies.Sociedad Argentina de Informática e Investigación Operativa (SADIO

    Isolated spanish digit recognition based on audio-visual features

    Get PDF
    The performance of classical speech recognition techniques based on audio features is degraded in noisy environments. The inclu-sion of visual features related to mouth movements into the recogni-tion process improves the performance of the system. This paper proposes an isolated word speech recognition system based on audio-visual features. The proposed system combines three classifiers based on au-dio, visual and audio-visual information, respectively. An audio-visual database composed by the utterances of the digits (in Spanish language) is employed to test the proposed system. The experimental results show a significant improvement on the recognition rates through a wide range of signal-to-noise ratios.IV Workshop procesamiento de señales y sistemas de tiempo real.Red de Universidades con Carreras en Informática (RedUNCI

    Generic Face Animation

    Get PDF
    International audienceIn computer vision, the animation of objects has attracted a lot attention, specially the animations of 3D face models. The animation of face models requires in general to manually adapt each generic movement (open/close mouth) to each specific head geometry. In this work we propose a technique for the animation of any face model avoiding most of the manual intervention. In order to achieve this we assume that: (1) faces, despite obvious differences are quite similar and a single generic model can be used to simplify deformations and (2) using this face model, a simple interpolation technique can be used, with minimal manual intervention. Several examples are presented to verify the realism of the obtained animations
    corecore